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INTERNATIONAL PRELIMINARY 

EXAMINATION REPORT International application No. PCT/NO00/00155 



I. Basis of the report 

1 . With regard to the elements of the international application (Replacement sheets which have been furnished to 
the receiving Office in response to an invitation under Article 14 are referred to in this report as "originally filed" 
and are not annexed to this report since they do not contain amendments (Rules 70. 16 and 70. 17)): 
Description, pages: 

1-11 as received on 1 6/07/2001 with letter of 1 1/07/2001 

Claims, No.: 

1 -9 as received on 1 6/07/2001 with letter of 1 1/07/2001 

Drawings, sheets: 

1-6 as originally filed 



2. With regard to the language, all the elements marked above were available or furnished to this Authority in the 
language in which the international application was filed, unless otherwise indicated under this item. 

These elements were available or furnished to this Authority in the following language: , which is: 

□ the language of a translation furnished for the purposes of the international search (under Rule 23.1(b)). 

□ the language of publication of the international application (under Rule 48.3(b)). 

□ the language of a translation furnished for the purposes of international preliminary examination (under Rule 
55.2 and/or 55.3). 

3. With regard to any nucleotide and/or amino acid sequence disclosed in the international application, the 
international preliminary examination was carried out on the basis of the sequence listing: 

□ contained in the international application in written form. 

□ filed together with the international application in computer readable form. 

□ furnished subsequently to this Authority in written form. 

□ furnished subsequently to this Authority in computer readable form. 

□ The statement that the subsequently furnished written sequence listing does not go beyond the disclosure in 
the international application as filed has been furnished. 

□ The statement that the information recorded in computer readable form is identical to the written sequence 
listing has been furnished. 

4. The amendments have resulted in the cancellation of: 

□ the description, pages: 

□ the claims, Nos.: 
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□ the drawings, sheets: 

5. H This report has been established as if (some of) the amendments had not been made, since they have been 

considered to go beyond the disclosure as filed (Rule 70.2(c)): 

(Any replacement sheet containing such amendments must be referred to under item 1 and annexed to this 
report.) 

see separate sheet 

6. Additional observations, if necessary: 



V. Reasoned statement under Article 35(2) with regard to novelty, inventive step or industrial applicability; 
citations and explanations supporting such statement 

1. Statement 

Novelty (N) Yes: Claims 1 -9 

No: Claims 

Inventive step (IS) Yes: Claims 1-9 

No: Claims 

Industrial applicability (IA) Yes: Claims 1-9 

No: Claims 



2. Citations and explanations 
see separate sheet 
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Re Item I 

Basis of the report 

(i) The feature of the amended claim 1 that 

the number of search nodes (N 6 ) in each search node column (S) scales with 
estimated or expected traffic load, thus implementing a scaling of performance, 

although being obvious, is not disclosed in the application as originally filed, where only 
is said (s. originally filed claim 1, last paragraph, and claim 10) that 

a scaling of performance takes place through replication of one or more partitions d, 
being implemented by an increase of the number of search nodes (N B ) in each search 
node column (S). 

(ii) The feature that 

scaling of performance takes place through replication of one or more partitions d, 

as is explicitly stated in the originally filed claim 1 and is valid for the whole application 
as originally filed, is no more explicitly stated in the amended claim 1 , thus allowing the 
interpretation that 

in any case searching the document collection D takes place with a single search 
node (N B ) holding by reference one of the partition-dependent data sets d Pjk and all 
search nodes (N B ) in a search node column (S B ) containing by reference identical 
partition-dependent data sets d Ptk , 
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which goes beyond the disclosure of the application as originally filed. 
Re Item V 

Reasoned statement under Art. 35(2) with regard to novelty, inventive step or industrial 
applicability; citations and explanations supporting such statement 

Reference is made to the following document: 

D1 : Erhard Rahm, Mehrrechner-Datenbanksysteme, Addison-Wesley, 1994, p. 
34,35,62-66,326-328 

The document D1was not cited in the international search report. A copy of the 
document was sent to the applicant. 

1 . The subject-matter of claim 1 is new and inventive in the sense of Articles 33(2) and 
33(3) PCT. 

State of the Art 

It is straightforward to implement an Internet search machine by setting up a database 
relation, in which each tupel comprises a keyword and the URL of a document containing 
the keyword. For storage and performance reasons it may be necessary to fragment the 
relation and to assign the fragments to different workstations, each corresponding to a 
search node column of the application, in a parallel shared-nothing database system 
(see D1, page 34, lines 17-24, page 35, lines 7-24). In order to have concurrent 
execution of requests as "return all URLs of documents containing keyword 1 and 
keyword 2" with minimum communication among the workstations, it is obvious to 
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horizontally fragment in respect to ranges or hash function results of the URLs, i.e. each 
fragment, and thus workstation, corresponds to a particular partition of the URLs, or 
documents (see D1, page 326-page 328, line 22, chapters 5.3.1 and 5.3.3). 

In this system 

the number v of search node columns (S) scales with the number n of partitions d, 
thus implementing a scaling of the data volume. 

It may happen that a maximum number of partition or fragment ranges, that is feasible, is 
reached or that a fragmentation may not be changed easily. In such a situation, in order 
to further improve performance it is the most obvious to realize the workstations to which 
the partitions are assigned as high-end multiprocessor workstations which already are 
used for Internet search engines, whereby each processor corresponds to a search node 
of the application. Then, 

a scaling of performance is implemented by an increase of the number of search 
nodes (N„), i.e. processors, in each search node column (S), i.e. multiprocessor 
workstation, 

but this takes place without replication of one or more partitions d. Thus, in this case 
searching the document collection D takes place with a single search node (N 6 ) merely 
having access to, i.e holding by reference only, the partition-dependent data set d pM 
stored on the respective multiprocessor workstation, i.e. all search nodes, or processors, 
in a search node column (S B ) are containing by reference only identical 
partition-dependent data sets d Pik . 

Problem 
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The use of high-end multiprocessor workstations may be very expensive, and they are 
scaleable only within limits. 

Solution 

The application proposes that 

in any case searching the document collection D takes place with a single search 
node (N„) holding one of the partition-dependent data sets d Pik and all search nodes 
(N B ) in a search node column (S„) containing identical partition-dependent data sets 

d P,k. 

whereby partitions are replicated, as is explicitly stated in the originally filed claim 1 . 

This allows alternative architectures as e.g. to realize search nodes by modules that 
hold replicated partitions on local memory, and hereby exploits the fact that no 
communication between search nodes is necessary. 

2. Claims 2-9 are dependent on claim 1 and as such also meet the requirements of 
the PCT with respect to novelty and inventive step. 



PCT/Separate Sheet/409 (Sheet 4) (EPO-April 1 997) 



A search engine with two-dimensional linearly scalable parallel 
architecture. 

The present invention concerns a search engine with two-dimensional 
linearly scalable parallel architecture for searching a collection of text 
documents D, wherein the documents can be divided into a number of 
partitions d h d 2 ,...d n , wherein the collection of documents D is preprocessed 
in a text filtration system such that a preprocessed document collection D p is 
obtained and corresponding preprocessed partitions d pl , d p2 ,... d pn , wherein an 
index / can be generated from the document collection D such that for each 
previous preprocessed partition d ph d p2 ,... d pn a corresponding index i h i 2 ,~. 
i„ is obtained, wherein searching a partition d of the document collection D 
takes place with a partition-dependent data set d Pik comprising both the 
preprocessed partition d pk and the corresponding index i h with l<k<n, and 
wherein the search engine comprises data processing units which form sets of 
nodes connected in a network. 

Most prior art search engines work with large data set and employ powerful 
computers to perform the search. However, searching is a partitionable data 
processing problem, and this fact can be used to partition a search problem 
into a large number of specific queries and let each query be processed 
simultaneously on a commensurate number of processors connected in 
parallel in a network. Particularly searching can be regarded as a binary 
partitionable data processing problem, and hence a binary tree network is 
used for establishing a multiprocessor architecture such as disclosed for 
instance in US patent No. 4860201 (Stolfo & al.) and international patent 
application PCT/N09 9/00308 which belongs to the applicant and hereby is 
incorporated by reference. The present applicant has developed proprietary 
technologies for searching within regular text documents. These technologies 
are i.a. based on a search system and a method for searching as described in 
international patent application PCT/NO99/00233 which belongs to the 
applicant and hereby is incorporated by reference. The search system is based 
on efficient core search algorithms which may be used in the search engine 
according to the invention. 

However, it has become increasingly important to cater for a growing number 
of documents to be searched and also to be able to handle an increased traffic 
load, i.e. the number of queries per second which shall be processed by the 



AMENDED SHEET 



16-07-2001 



2 



NO0000155 



search system. This, apart from the ability to handle a large number of 
queries simultaneously on processor level, implies that a search engine 
should be implemented with an architecture that allows for preferably linear 
scalability in two dimensions, viz. both with regard to the data volume and 
5 the performance, i.e. the ability to handle a very large number of queries per 
second. Considering the development of the World Wide Web, a scalability 
problem in the search engine architecture will be extremely important as 
there presently is an enormous growth rate in both the number of documents 
and the number of users on the Internet. 

10 Prior art search engine solutions for Internet are able to scale to a certain 

level, but almost always this is achieved in a manner that requires a high cost 
increase of the search engine system relative to the growth in data volume or 
data traffic. Very often the system costs scale as the square of the data 
volume or the traffic, a doubling of the data volume thus leading to 

15 quadrupled system costs. Furthermore all the major Internet search engines 
presently are based on very expensive server technology, often coupled with 
brute computing force-approaches and accompanied with disadvantages such 
as slow server turnaround, requirements for special hardware to provide fault 
tolerance etc. The system costs can e.g. be measured as the amount of 

20 hardware required to implement a search engine solution or the actual 
aggregated price of the system. 

It is thus a main object of the present invention to provide a search engine 
with a multilevel data and functional parallelism, such that large volumes of 
data can be searched efficiently and very fast by a large number of users 
25 simultaneously. 

Particularly it is a further object of the invention to provide a parallel 
architecture for implementing a search engine with a multilevel data and 
functional parallelism. 

Yet a further object of the present invention is to provide a parallel 
30 architecture which is linearly scalable in two dimensions, i.e. with regard to 
both data volume and performance, that is the query rate. 

The above-mentioned objects and further features and advantages are 
provided with a search engine according to the invention which is 
characterized in that the first set of nodes comprises a dispatch nodes, a 
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second set of nodes comprises b search nodes, a third set of nodes comprises 
g indexing nodes, and an optional fourth set of nodes comprises e acquisition 
nodes, that the dispatch nodes are connected in a multilevel configuration in 
the network, that the search nodes are grouped in v columns which are 
5 connected in parallel in the network between the dispatch nodes and an 

indexing node, that the dispatch nodes are adapted to process search queries 
and search answers, the search queries being dispatched further to all search 
nodes and in case the acquisition nodes are not present, the search answers 
being returned to the dispatch nodes and therein being combined to a final 

10 search result, that the search nodes each are adapted to contain search 
software, that the indexing nodes are adapted for generally generating 
indexes i for the search software and optionally for generating 
partition-dependent data sets d Pfk to search nodes which comprise a search 
processor module, that in case acquisition nodes are present, these are 

15 connected in a multilevel configuration in the network similar to that of the 
dispatch node, and adapted for gathering answers to search queries and 
outputting a final result thereof, thus relieving the dispatch nodes of this task, 
that the number v of search node columns scales with the number n of 
partitions d, thus implementing a scaling of the data volume, and that the 

20 number of search nodes in each search node column scales with an estimated 
or expected traffic load, thus implementing a scaling of performance, 
whereby in any case searching the document collection D takes place with a 
single search node holding one of the partition-dependent data sets d Pik and 
all search nodes in a search node column containing identical 

25 partition-dependent data sets d Ptk . 

According to the invention are advantageously the multilevel configuration 
of the dispatch nodes and the optional acquisition nodes network provided by 
hierarchical tree structures, and the multilevel configuration of the optional 
acquisition nodes is then preferably a mirror image of the multilevel 
30 configuration of the dispatch nodes, the hierarchical tree structures 
preferably being binary tree structures. 

According to the invention each of the search nodes advantageously 
comprises a search software module. 

Further, according to the invention at least some of the search nodes 
35 comprises at least one dedicated search processor module, each dedicated 
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search processor module being realized with one or more dedicated processor 
chips, which is adapted for parallel handling of a number of search queries. 
In this connection it is preferred that the dedicated search processor chips are 
provided in the search processor modules in y processor groups, each with z 
5 search processor chips and being connected with and adapted to receive data 
from a memory assigned to the processor group. 

According to the invention the increase in the number of partitions can 
preferably be accompanied by a corresponding increase in the number of 
dispatch nodes and, in case, also in the number of acquisition nodes, and 
10 optionally also by an increase in the number of index nodes. 

Finally it is according to the invention advantageous that the separate node 
sets each is implemented over one or more workstations connected in a data 
communications network. 

The search engine according to the invention shall now be described in terms 
15 of non-limiting exemplary embodiments and with reference to the 
accompanying drawings, in which 

fig. 1 shows a general overview of the architecture in a first embodiment of 
the search engine according to the invention, 

fig. 2 a general overview of the architecture in a second embodiment of the 
20 search engine according to the invention, 

fig. 3 schematically a search node with search software and dedicated 
hardware search chips, 

fig. 4 the layout of a single module of search chips, 

fig. 5 schematically the query handling in a dedicated search chip, 

25 fig. 6 schematically the principle of performance scaling, 

fig. 7 a multilevel dispatch node configuration based on a binary tree 
structure, 

fig. 8 an overview of the architecture of a first embodiment of the search 
engine according to the invention, with an indication of the principle for 
30 two-dimensional scaling, and 
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fig. 9 an overview of the architecture of a second embodiment of the search 
engine according to the invention, with an indication of the principle for 
two-dimensional scaling. 

Searching a large collection of independent documents is a highly parallel 
5 task. The search engine according to the invention employs parallelism on 
different levels as shall be discussed in the following. 

The search engine according to the present invention searches a document 
collection of documents Z>. The documents can be divided into n partitions 
dj, d 2 , d n . Each document collection D y or partition d of a document 

10 collection can be preprocessed for use in a hardware text filtering system, for 
instance implemented by dedicated hardware like the applicant's so-called 
Pattern Matching Chip (PMC) which is disclosed in the applicant's 
international patent application No. PCT/NO99/00344 which hereby is 
incorporated by reference. The preprocessed document collection is denoted 

15 D p and the corresponding preprocessed document collection partitions d p are 
denoted d p2 , d p2 , d pn . 

Software-based search systems require an index generated from the document 
collection. The index is denoted / and the indexes corresponding to the 
document collection partitions d pU d p2t ... d pn , are denoted i h i 2 ,...i n . 

20 The data set needed for searching a partition d of the document collection D, 
is called the partition-required or partition-dependent data set. In a software 
only system (SW system) the data set is the index i k) while in the systems 
with hardware (SW/HW systems), the data set also includes the preprocessed 
document collection partition d p k with the corresponding index z*, where 1< k 

25 <n. 

The essentially software-implemented partitioning and preprocessing 
operations can be rendered schematically as 

*(D)^*(d Ir ..,d n )~>*(d pJ ,... t d pn )—>d pk , where *(D) denotes a partitioning 
operation on the input D, *(d!,...,d n ) a filtering operation, e.g. indexing, on 
30 d!,...,d n and d Ptk of course is the partition-dependent data set, which in an SW 
system only shall be the index i k , and with 1 < k < n. 

A search engine is implemented on a cluster of workstations that are 
connected using a high performance interconnect bus. The not shown 
workstations then constitute the server of the search system (search server). 
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The workstations implement nodes of the search server. The nodes perform 
different tasks and are according to the invention implemented as set out 
below. 

The nodes can be regarded as virtual nodes distributed among the 
5 workstations, but in a SW/HW search engine the dedicated search processing 
hardware must be physically present in some workstations in order to support 
the hardware-based search nodes. The search node software may then still be 
distributed. Also, some search nodes in SW/HW search engine may comprise 
software only, and optionally be distributed over more than one workstation. 

10 • Dispatch nodes N a i,...N aa handle incoming queries and send queries along 
to all the search nodes. The dispatch nodes can also be configured as 
acquisition nodes, gathering the answers, i.e. the search results to the 
queries. Upon receiving the answers, the dispatch nodes in the acquisition 
mode merge the search results into a final result. 

15 • Search nodes Npj,...,Np b hold a portion of the entire data set d p . A search 
node comprises both the dedicated search software as well as a number of 
the above-mentioned PMC modules for hardware searching. 

• Indexing nodes N Yl ,...,N y5 are responsible for generating indexes for the 
dedicated search software on a number of search nodes. For the PMC 

20 modules, the indexing node N Y also filters and preprocesses the raw data. 

• Acquisition nodes Ns^-.^Nfic may optionally be provided for gathering the 
answers and merging the search results into a final result, in which case 
the dispatch node N a ^ of course, is relieved of the acquisition task. 

A first embodiment of the search engine according to the invention is shown 
25 in fig. 1, where the search nodes Np are provided in columns or groups S 
provided and connected in parallel between the dispatch nodes N a and the 
indexing nodes N Y . The arrangement of the dispatch nodes N a is shown 
schematically, but in practice they would be provided in a multilevel 
hierarchical arrangement. 

30 A second embodiment of the search engine according to the invention is 
shown in fig. 2, where acquisition nodes N§ are provided in a similar 
arrangement as that of the dispatch nodes, relieving the latter of the 
acquisition task. 



AMENDED SHEET 



16-07-2001 



^ NO0000155 

7 



It is to be understood that singular workstations may implement a specific 
type of nodes only, on alternatively more than one type of nodes. In other 
words, the different types of nodes may be distributed over the cluster of 
workstations. Hence the architecture shown in figs. 1 and 2 is implemented 
5 by the whole cluster, and these figures accordingly do neither show the 
workstations nor the interconnect bus. 

The nodes shall now be discussed in more detail, starting with the search 
nodes which are central to the search engine according to the invention. 

A search node Np holds as mentioned a portion of the entire data set d p . The 
10 search node has both a software search engine SW, and optionally a number 
of PMC modules M, as shown in fig. 3. The data set for a search node Np is 
generated on an indexing node N Y , of which more anon. 

A search node may be equipped with a number x of PMC modules M for very 
fast searching, as shown in fig. 3. Each PMC module M has y groups G of z 
15 PMCs, as shown in figure 4, where each group G receives data from a single 
memory chip RAM. These A module M will typically be an individual circuit 
board. Each chip PMC is capable of processing q simultaneous queries, as 
shown schematically in fig. 5. 

A pattern matching chip PMC can process a data volume of t c bytes per 
20 second. Assuming that the memory modules are capable of delivering t y bytes 
per second to the pattern matching chips PMCs, a PMC can search through 
the data volume of T c bytes, T c = min{t c ,t y }t, in the given time f. 

As shown in figure 4, the pattern matching chips PMC are placed in modules 
M with y groups G of z chips PMC, where each group G receives data from a 
25 single memory chip RAM, and the size of the memory chip is T c . The total 
amount of data this module can search through is T y = T c with zq different 
queries. 

When x modules M are provided in a search node Np, these PMC modules M 
can search through an amount of data equal to T r = T y • jc= min{ t Ct t y }txy - 
30 since no PMC modules search through the same data, the number of 
concurrent queries is still zq. 

Thus the total query rate of the PMC modules in a search node can be 
expressed as 
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minfr^ }txy 

where T r denotes the total data volume on a node. The search node 
performance can now be calculated. 

Given that the PMC modules M (or any hardware equivalent) has a query rate 
5 of r HW and that the search software on a search node Np has a query rate of 
r S w* the total query rate r^of a search node N s can be expressed as 

r Z = r t,HW 0 ~ <PSW ) + r t.SW<PsW (2) 

where q> S w denotes the percentage of queries q that will be executed in 
software. The actual value of <p S w is dynamically updated at runtime from a 
10 statistical model. 

The dispatch nodes N a receive all the queries, and resend them to all the 
search nodes Np. The answers from the different search nodes Np are merged 
and in case the dispatch nodes N a functions as acquisition nodes, a complete 
answer is returned. 

15 The indexing nodes N Y collect documents and create prebuilt indexes for the 
search software on the different search nodes Np. Hence the indexing nodes 
N Y can be incorporated in the search nodes Np with appropriate indexing 
software in the latter. The hardware is based on scanning through the entire 
collection of raw data, but some preprocessing and filtering of the raw data 

20 can be done in the indexing nodes N Y . 

Concerning the interconnect and data traffic, some general observations can 
be made based on the following considerations. 

Different types of interconnect can be used for connecting the nodes. For a 
lower end system, a regular 100 Mbit Fast Ethernet will for instance handle 
25 the traffic. 

Traffic on the interconnect between the nodes can be divided into two 
categories: 

• Query traffic - traffic between dispatch nodes N a and search nodes Np. 
This traffic is present all the time when searching is performed. The query 
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traffic is characterized by low to medium data volumes and high 
frequency. 

• Data traffic - traffic between indexing nodes N Y and search nodes Np. The 
data traffic is characterized by high data volumes and low frequency 
5 (typically one batch per day). 

A typical query will transfer a query string from the dispatch node N a to the 
search nodes Np. Then the search nodes Np will reply with a sequence of 
documents matching the query. Optionally N a shall also be able to query the 
search node for the URL strings for the document, but this is considered 
10 immaterial in the present context. 

The architecture of the search engine according to the invention can, based 
on the above considerations, now easily be scaled in two dimensions, viz. the 
data volume and the performance dimension respectively, 

Data volume scaling is achieved by adding more data set partitions d y in other 
15 words more groups or columns S of search nodes Np are added. Also the 
number of indexing nodes N Y and dispatch nodes N a can be increased as 
necessary in order to handle more data set partitions d. 

Performance scaling can be achieved in the search engine architecture by 
replicating data set partitions d with a corresponding increase in the number 
20 of search nodes Np, by increasing the number as illustrated in fig. 6. When 
using replication of data set partitions to scale the system performance, each 
search node Np is part of the search node group S. Thus, the search nodes 

Npi,...,Np,j are arranged into groups Sp l5 ...,Sp v , where v= — , h s denoting the 

scaling factor. The group Spj contains the search nodes Npj,Npj +l , Np J+2 and 
25 Npj + 3 as rendered in fig. 8. 

Scaling the data volume may cause the number of search nodes Np receiving 
queries broadcast from a dispatch node N a growing quite large. The 
architecture solves this problem by using several levels X of dispatch nodes 
N a - this is illustrated in fig. 7, which renders the arrangement of the 
30 dispatch nodes N a as nodes in a portion of a binary data distribution tree. A 
binary data distribution tree easily allows for a linear scalability. Similar 
binary data distribution trees of the kind that already has been disclosed in 
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the applicant's above-mentioned international application PCT/NO99/00344, 
which discloses the configuration of an actual implementation of the pattern 
matching chip PMC. The number of dispatch nodes N a in a regular binary 
tree is, of course 2 X ' X on each level A, X = 1,2,3... A dispatch root node is on 
5 the first level, and up to and including a given level A, there is a total of 2 X -1 
dispatch nodes in the tree. In case the dispatch nodes N a are used also as 
acquisition nodes, i.e. for gathering the answers returned from the search 
nodes, the results of a search is merged in the dispatch nodes, the root 
dispatch node outputting the final answer to the query. However, there is 

10 nothing against that the search engine according to the invention is set up 
with a separate data gathering tree connected to the search nodes and 
comprising acquisition nodes N5 gathering and outputting the final result of a 
query on the acquisition root node of the data gathering tree, i.e. the data 
acquisition node tree. The acquisition node tree could then be a mirror image 

15 of the dispatch node tree. 

The schematic layout of a scalable search engine architecture according to 
the invention is shown in fig. 8, with the principle for two-dimensional 
scaling illustrated. It will be seen that the dispatch nodes N a constitute a 
front end of the search engine and route the queries to the search node Np and 

20 receives the search results back from the search nodes, wherein the actual 

search of the indexed data is performed. In case dedicated acquisition nodes 
Na are used, as shown in fig. 9, which otherwise is similar to fig. 8, the 
search results would, of course, be returned thereto. With the dispatch nodes 
N a arranged in a tree configuration as shown in fig. 9, the acquisition node 

25 network as a back end of the search engine would form a mirror image of the 
dispatch node network. The indexing (spidering) nodes N Y also constitute a 
back end of the search engine and collect data from e.g. the Internet and 
index the data to generate a searchable catalogue. By adding search nodes N p 
or rather search nodes groups S horizontally the search engine scales linearly 

30 in the data volume, each additional search node or search node group 

containing different data. Typical capacity parameters for a search engine can 
as a non-limiting instance be given as follows. One search node Np can 
typically handle 8 000 000 page views per day in a 5 000 000 documents 
catalogue. For a scalable search engine each search node Np typically could 

35 hold 5 000 000 unique indexed documents, implying that 40 search nodes in 
one row are enough to maintain a 200 000 000 documents catalogue. Scaling 
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the performance, i.e. increasing the traffic capacity demands more rows of 
search nodes Np with the same data to be added, such that the search nodes in 
a single column or group S contain identical data. A group or column S of 10 
search nodes Np hence will be able to handle 80 000 000 page views per day, 
5 with 40 columns handling a total of 3 200 000 000 page views per day. 

An additional important benefit of a search engine according to the invention 
with the architecture scalable as herein disclosed, is that the query response 
time is essentially independent of the catalogue size, as each query is 
executed in parallel on all search nodes Np and that the architecture is 
10 inherently fault-tolerant, such that faults in the individual nodes will not 

result in a system breakdown, only temporary reduce the performance until 
the fault is corrected. 

Moreover, the in principle unlimited linear scalability of the data volume and 
the traffic volume which can be provided in a search engine according to the 

15 invention, contrasts sharply with prior art search engines, wherein the search 
cost typically increases exponentially with the data or traffic volume 
increase, and wherein the maximum capacity of the prior art search engines 
typically will be reached at low to moderate volumes. With the search engine 
according to the invention the cost will scale linearly with the increase in 

20 capacity at most, depending actually on whether the capacity increase is 
provided by adding SW search nodes only or also SW/HW search nodes. 
Finally the search engine according to the invention offers the advantage that 
each node in practice can be implemented with standard low cost 
commercially available PCs, but alternatively also with more expensive 

25 UNIX-based servers such as for instance the Sun or Alpha computers as 
currently available. 



30 
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PATENT CLAIMS 

1. A search engine with two-dimensional linearly scalable parallel 
architecture for searching a collection of text documents £>, wherein the 
documents can be divided into a number of partitions d h d 2 ,.*.d ny wherein the 
5 collection of documents D is preprocessed in a text filtration system such that 
a preprocessed document collection D p is obtained and corresponding 
preprocessed partitions d p} , d p2 >... d pn , wherein an index / can be generated 
from the document collection D such that for each previous preprocessed 
partition d pI , d p2 ,... d pn a corresponding index i }j z 2 ,... i n is obtained, wherein 

10 searching a partition d of the document collection D takes place with a 

partition-dependent data set d p k comprising both the preprocessed partition 
d pk and the corresponding index i k , with l<k<n, and wherein 
the search engine comprises data processing units which form sets of nodes 
(N) connected in a network, characterized in that 

15 the first set of nodes comprises a dispatch nodes (N a i, N aa ), a second set 
of nodes comprises b search nodes (Npi,...,Np b ), a third set of nodes 
comprises g indexing nodes (N Yl ,...,N Yg ), and an optional fourth set of nodes 
comprises e acquisition nodes (Nsi,...N5 e ). 

that the dispatch nodes (N a ) are connected in a multilevel configuration in 

20 the network, 

that the search nodes (Np) are grouped in v columns (S) which are connected 
in parallel in the network between the dispatch nodes (N a ) and an indexing 
node (N Y ), that the dispatch nodes (No) are adapted to process search queries 
and search answers, the search queries being dispatched further to all search 

25 nodes (Np), and in case the acquisition nodes (N5) are not present, the search 
answers being returned to the dispatch nodes (N a ) and therein being 
combined to a final search result, 

that the search nodes (Np) each are adapted to contain search software, 
that the indexing nodes (N Y ) are adapted for generally generating indexes i 

30 for the search software and optionally for generating partition-dependent data 
sets d Ptk to search nodes (Np) which comprise a search processor module, 
that in case acquisition nodes (N5) are present, these are connected in a 
multilevel configuration in the network similar to that of the dispatch nodes 
(N a ), and adapted for gathering answers to search queries and outputting a 

35 final result thereof, thus relieving the dispatch nodes of this task, 
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that the number v of search node columns (S) scales with the number n of 
partitions d, thus implementing a scaling of the data volume, and that the 
number of search nodes (Np) in each search node column (Sp) scales with an 
estimated or expected traffic load, thus implementing a scaling of 
5 performance, whereby in any case searching the document collection D takes 
place with a single search node (Np) holding one of the partition-dependent 
data sets d Ptk and all search nodes (Np) in a search node column (Sp) 
containing identical partition-dependent data sets d p k . 

10 2. A search engine according to claim 1, 

characterized in that the multilevel configuration of the dispatch nodes (N a ) 
and the optional acquisition nodes (N5) in the network are provided by 
hierarchical tree structures. 

3. A search engine according to claim 2, 

15 characterized in that multilevel configuration of the optional acquisition 

nodes (N5) is a mirror image of the multilevel configuration of the dispatch 
nodes (N a ). 

4. A search engine according to claim 2, 

characterized in that the hierarchical tree structures are binary tree structures. 

20 5. A search engine according to claim 1, 

characterized in that each of the search nodes (Np) comprises a search 
software module (SW). 

6. A search engine according to claim 5, 

characterized in that at least some of the search nodes (Np) comprises at least 
25 one dedicated search processor module (M), each dedicated search processor 
module (M) being realized with one or more dedicated search processor chips 
(PMC) which each is adapted for parallel handling of a number of q search 
queries. 

7. A search engine according to claim 6, 

30 characterized in that the dedicated search processor chips (PMC) are 

provided in the search processor modules (M) in y processor groups (G), each 
with z search processor chips (PMC) and being connected with and adapted 
to receive data from a memory (RAM) assigned to the processor group (G). 
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8. A search engine according to claim 1, 

characterized in that an increase in the number of partitions d in the scaling 
of the data volume is accompanied by a corresponding increase in the number 
of dispatch nodes (No) and, in case, also in the number of acquisition nodes 
5 (N s ), and optionally also by an increase in the number of index nodes (N y ). 

9. A search engine according to claim 1, 

characterized in that the separate node sets (N a , N p , N Y , N 5 ) each is 
implemented over one or more workstations connected in a data 
communications network. 
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A search engine with two-dimensional linearly scalable parallel 
architecture. 

The present invention concerns a search engine with two-dimensional 
.inea^iy scaiabie parallel architecture for searching a col.econ of tex 
5 documents D. wherein the documents can be divided into a number of 
5 aZns d„ A...*, wherein the collection of documents D "J££Zi 
I a text filtration system such that a processed document col.ec«on D ■ * 
obtained and corresponding preprocessed partitions «„ ,d pl ,. d 
index I can be generated from the document coUecfon D such that for each 
,0 previous preprocessed partition d p „ d p ,... d„ a correspond.ng .ndex ,,, , 
Zs obtained wherein searching a partition d of the document coHea.cn D 
."axes p.ace with a partition-dependen, data set d p . t <°™ ^ *• 

. . i j th( , rorresDonding index i*, with IS k< n, ana 
preprocessed partition ^ and the correspc g ^ ^ 

wherein the search engine comprises data processing un 

15 nodes connected in a network. 

Most prior art search engines work with large data set and employ powerful 
TomXrs to perform the search. However, searching s a partmonabU da* 
processing problem, and this fact can be used to partition a search prob,em 
ZZZl number of specific queries and let each query be process d 

20 illtaneously on a commensurate number of P ro « sso ™f * 

U, in a network. P--**^^ n^is 

^rei;;^^^^ 

r - ^^^^ - — - rr in 

National paten, application PCT— «, ----- ^ 
30 applicant and hereby * h ^ fc« - «■ eng . ne 

on efficient core search algorithms wlucn may o 
according to the invention. 

However it has become increasingly important to cater for a growing number 
However, it nas oei. increased traffic 

of documents to be ^ shall be processed by the 

35 load, i.e. the number of queries per setunu 
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search system. This, apart from the ability to handle a large number of 
queries simultaneously on processor level, implies that a search engine 
should be implemented with an architecture that allows for preferably linear 
scalability in two dimensions, viz. both with regard to the data volume and 
5 the performance, i.e. the ability to handle a very large number of queries per 
second. Considering the development of the World Wide Web, a scalability 
problem in the search engine architecture will be extremely important as 
there presently is an enormous growth rate in both the number of documents 
and the number of users on the Internet. 

10 Prior art search engine solutions for Internet are able to scale to a certain 

level, but almost always this is achieved in a manner that requires a high cost 
increase of the search engine system relative to the growth in data volume or 
data traffic. Very often the system costs scale as the square of the data 
volume or the traffic, a doubling of the data volume thus leading to 

15 quadrupled system costs. Furthermore all the major Internet search engines 
presently are based on very expensive server technology, often coupled with 
brute computing force-approaches and accompanied with disadvantages such 
as slow server turnaround, requirements for special hardware to provide fault 
tolerance etc. The system costs can e.g. be measured as the amount of 

20 hardware required to implement a search engine solution or the actual 
aggregated price of the system. 

It is thus a main object of the present invention to provide a search engine 
with a multilevel data and functional parallelism, such that large volumes of 
data can be searched efficiently and very fast by a large number of users 
25 simultaneously. 

Particularly it is a further object of the invention to provide a parallel 
architecture for implementing a search engine with a multilevel data and 
functional parallelism. 

Yet a further object of the present invention is to provide a parallel 
30 architecture which is linearly scalable in two dimensions, i.e. with regard to 
both data volume and performance, that is the query rate. 

The above-mentioned objects and further features and advantages are 
provided with a search engine according to the invention which is 
characterized in that the first set of nodes comprises a dispatch nodes, a 
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second set of nodes comprises b search nodes, a third set of nodes comprises 
g indexing nodes, and an optional fourth set of nodes comprises e acquisition 
nodes, that the dispatch nodes are connected in a multilevel configuration in 
the network, that the search nodes are grouped in columns which are 
5 connected in parallel in the network between the dispatch nodes and an 

indexing node, that the dispatch nodes are adapted to process search queries 
and search answers, the search queries being dispatched further to all search 
nodes and in case the acquisition nodes are not present, the search answers 
being returned to the dispatch nodes and therein being combined to a final 

10 search result, that the search nodes each are adapted to contain search 

software, that at least some of the search nodes additionally comprise at least 
one search processor module, that the indexing nodes are adapted for 
generally generating indexes / for the search software and optionally for 
generating partition-dependent data sets d pk to search nodes which comprise 

15 a search processor module, that in case acquisition nodes are present, these 
are connected in a multilevel configuration in the network similar to that of 
the dispatch node, and adapted for gathering answers to search queries and 
outputting a final result thereof, thus relieving the dispatch nodes of this task, 
and that the two-dimensional linear scaling respectively takes place by 

20 scaling of the data volume through an increase in the number of partitions d 
and scaling of performance through replication of one or more partitions d. 

According to the invention are advantageously the multilevel configuration 
of the dispatch nodes and the optional acquisition nodes network provided by 
hierarchical tree structures, and the multilevel configuration of the optional 
25 acquisition nodes is then preferably a mirror image of the multilevel 
configuration of the dispatch nodes, the hierarchical tree structures 
preferably being binary tree structures. 

According to the invention each of the search nodes advantageously 
comprises a search software module. 

30 Further, according to the invention at least some of the search nodes 

comprises at least one dedicated search processor module, each dedicated 
search processor module being realized with one or more dedicated processor 
chips, which is adapted for parallel handling of a number of search queries. 
In this connection it is preferred that the dedicated search processor chips are 

35 provided in the search processor modules in v processor groups, each with i 
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search processor chips and being connected with and adapted to receive data 
from a memory assigned to the processor group. 

According to the invention the increase in the number of partitions in the 
scaling of the data volume is advantageously implemented by an increase in 
the number of search node groups or columns. In this connection the increase 
in the number of partitions can preferably be accompanied by a 
corresponding increase in the number of dispatch nodes and. in case, also in 
the number of acquisition nodes, and optionally also by an increase in the 
number of index nodes. 

According to the invention the replication of one or more partition in the 
scaling of performance is advantageously implemented by an increase in the 
number of search nodes in each group or column. 

Finally it is according to the invention advantageous that the separate node 
sets each is implemented over one or more workstations connected in a data 
communications network. 

The search engine according to the invention shall now be described in terms 
of non-limiting exemplary embodiments and with reference to the 
accompanying drawings, in which 

fig. 1 shows a general overview of the architecture in a first embodiment of 
the search engine according to the invention. 

fig. 2 a general overview of the architecture in a second embodiment of the 
search engine according to the invention, 

fig. 3 schematically a search node with search software and dedicated 
hardware search chips, 

fig. 4 the layout of a single module of search chips, 

fig. 5 schematically the query handling in a dedicated search chip, 

fig. 6 schematically the principle of performance scaling, 

fig. 7 a multilevel dispatch node configuration based on a binary tree 
structure. 



PCT/NO00/00155 
WO 00/68834 



fig. 8 an overview of the architecture of a first embodiment of the search 
engine according to the invention, with an indication of the principle for 
two-dimensional scaling, and 

fig. 9 an overview of the architecture of a second embodiment of the search 
5 engine according to the invention, with an indication of the principle for 
two-dimensional scaling. 

Searching a large collection of independent documents is a highly parallel 
task. The search engine according to the invention employs parallelism on 
different levels as shall be discussed in the following. 

10 The search engine according to the present invention searches a document 
collection of documents D. The documents can be divided into n partitions 
d h d 2 , d n . Each document collection D. or partition d of a document 
collection can be preprocessed for use in a hardware text filtering system, for 
instance implemented by dedicated hardware like the applicant's so-called 

15 Pattern Matching Chip (PMC) which is disclosed in the applicant's 

international patent application No. PCT/NO99/00344 which hereby is 
incorporated by reference. The preprocessed document collection is denoted 
D p and the corresponding preprocessed document collection partitions d p are 
denoted d ph d p2 , d pn . 

20 Software-based search systems require an index generated from the document 
collection. The index is denoted / and the indexes corresponding to the 
document collection partitions d ph d p2f ... d pn , are denoted //, /?,.../„. 

The data set needed for searching a partition d of the document collection D, 
is called the partition-required or partition-dependent data set. In a software 
25 only system (SW system) the data set is the index i k , while in the systems 

with hardware (SW/HW systems), the data set also includes the preprocessed 
document collection partition d Pmk with the corresponding index where 1< k 
<n. 

The essentially software-implemented partitioning and preprocessing 
30 operations can be rendered schematically as 

*(D)-**(d L ...,d n )->*(d pI d pn j->d rk , where *(D) denotes a partitioning 

operation on the input ZX *(d t d n ) a filtering operation, e.g. indexing, on 

d|,...,d n and d p . k of course is the partition-dependent data set. which in an SW 
system only shall be the index i k . and with 1 < k<n. 
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A search engine is implemented on a cluster of workstations that are 
connected using a high performance interconnect bus. The not shown 
workstations then constitute the server of the search system (search server). 
The workstations implement nodes of the search server. The nodes perform 
different tasks and are according to the invention implemented as set out 
below. 

The nodes can be regarded as virtual nodes distributed among the 
workstations, but in a SW/HW search engine the dedicated search processing 
hardware must be physically present in some workstations in order to support 
the hardware-based search nodes. The search node software may then still be 
distributed. Also, some search nodes in SW/HW search engine may comprise 
software only, and optionally be distributed over more than one workstation. 

• Dispatch nodes N a! ,...N aa handle incoming queries and send queries along 
to all the search nodes. The dispatch nodes can also be configured as 
acquisition nodes, gathering the answers, i.e. the search results to the 
queries. Upon receiving the answers, the dispatch nodes in the acquisition 
mode merge the search results into a final result. 

• Search nodes N P i,...,N pb hold a portion of the entire data set d p . A search 
node comprises both the dedicated search software as well as a number of 
the above-mentioned PMC modules for hardware searching. 

• Indexing nodes N y! ,... ? N yg are responsible for generating indexes for the 
dedicated search software on a number of search nodes. For the PMC 
modules, the indexing node N r also filters and preprocesses the raw data. 

• Acquisition nodes N 6I? ...,N 6e may optionally be provided for gathering the 
answers and merging the search results into a final result, in which case 
the dispatch node N a . of course, is relieved of the acquisition task. 

A first embodiment of the search engine according to the invention is shown 
in fig. 1 ? where the search nodes N p are provided in columns or groups S 
provided and connected in parallel between the dispatch nodes N a and the 
indexing nodes N y . The arrangement of the dispatch nodes N a is shown 
schematically, but in practice they would be provided in a multilevel 
hierarchical arrangement. 
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A second embodiment of the search engine according to the invention is 
shown in fig. 2. where acquisition nodes N$ are provided in a similar 
arrangement as that of the dispatch nodes, relieving the latter of the 
acquisition task. 

5 It is to be understood that singular workstations may implement a specific 
type of nodes only, on alternatively more than one type of nodes. In other 
words, the different types of nodes may be distributed over the cluster of 
workstations. Hence the architecture shown in figs. 1 and 2 is implemented 
by the whole cluster, and these figures accordingly do neither show the 
10 workstations nor the interconnect bus. 

The nodes shall now be discussed in more detail, starting with the search 
nodes which are central to the search engine according to the invention. 

A search node Np holds as mentioned a portion of the entire data set d p . The 
search node has both a software search engine SW, and optionally a number 
15 of PMC modules M, as shown in fig. 3. The data set for a search node Np is 
generated on an indexing node N r of w hich more anon. 

A search node may be equipped with a number x of PMC modules M for very 
fast searching, as shown in fig. 3. Each PMC module M has v groups G of z 
PMCs, as shown in figure 4, where each group G receives data from a single 
20 memory chip RAM. These A module M will typically be an individual circuit 
board. Each chip PMC is capable of processing q simultaneous queries, as 
shown schematically in fig. 5. 

A pattern matching chip PMC can process a data volume of i c bytes per 
second. Assuming that the memory modules are capable of delivering t v bytes 
25 per second to the pattern matching chips PMCs, a PMC can search through 
the data volume of T c bytes, T c = min{/ f ,/y}/, in the given time t. 

As shown in figure 4. the pattern matching chips PMC are placed in modules 
M with v groups G of z chips PMC. where each group G receives data from a 
single memory chip RAM. and the size of the memory chip is T c . The total 
30 amount of data this module can search through is T v = T c with zq different 
queries. 

When x modules M are provided in a search node N (5 . these PMC modules M 
can search through an amount of data equal to T r = T v • .v= min{ t c t v )txy — 
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since no PMC modules search through the same data, the number of 
concurrent queries is still zq. 

Thus the total query rate of the PMC modules in a search node can be 
expressed as 

5 r HW f (1) 

fr 

m\n{t Cy iy)txy 

where T r denotes the total data volume on a node. The search node 
performance can now be calculated. 

Given that the PMC modules M (or any hardware equivalent) has a query rate 
of r HW and that the search software on a search node Np has a query rate of 
10 r S jy 9 the total query rate rrof a search node N s can be expressed as 

h = r <Mw 0 ~ <Psn- ) + r <.s»><Psv (2) 

where <p S w denotes the percentage of queries q that will be executed in 
software. The actual value of q>$w is dynamically updated at runtime from a 
statistical model. 

15 The dispatch nodes N a receive all the queries, and resend them to all the 

search nodes Np. The answers from the different search nodes Np are merged 
and in case the dispatch nodes N a functions as acquisition nodes, a complete 
answer is returned. 

The indexing nodes N r collect documents and create prebuilt indexes for the 
20 search software on the different search nodes Np. Hence the indexing nodes 
N r can be incorporated in the search nodes Np with appropriate indexing 
software in the latter. The hardware is based on scanning through the entire 
collection of raw data, but some preprocessing and filtering of the raw data 
can be done in the indexing nodes N r 

25 Concerning the interconnect and data traffic, some general observations can 
be made based on the following considerations. 

Different types of interconnect can be used for connecting the nodes. For a 
lower end system, a regular 100 Mbit Fast Ethernet will for instance handle 
the traffic. 
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Traffic on the interconnect between the nodes can be divided into two 
categories: 

• Query traffic - traffic between dispatch nodes N„ and search nodes Np. 
This traffic is present all the time when searching is performed. The query 

5 traffic is characterized by low to medium data volumes and high 

frequency. 

• Data traffic - traffic between indexing nodes N-, and search nodes Np. The 
data traffic is characterized by high data volumes and low frequency 
(typically one batch per day). 

10 A typical query will transfer a query string from the dispatch node N a to the 
search nodes Np. Then the search nodes Np will reply with a sequence of 
documents matching the query. Optionally N a shall also be able to query the 
search node for the URL strings for the document, but this is considered 
immaterial in the present context. 

15 The architecture of the search engine according to the invention can, based 

on the above considerations, now easily be scaled in two dimensions, viz. the 
data volume and the performance dimension respectively, 

Data volume scaling is achieved by adding more data set partitions d y in other 
words more groups or columns S of search nodes Np are added. Also the 
20 number of indexing nodes N y and dispatch nodes N a can be increased as 
necessary in order to handle more data set partitions d. 

Performance scaling can be achieved in the search engine architecture by 
replicating data set partitions d with a corresponding increase in the number 
of search nodes Np, by increasing the number as illustrated in fig. 6. When 
25 using replication of data set partitions to scale the system performance, each 
search node Np is part of the search node group S. Thus, the search nodes 

Npi,...,Np, are arranged into groups Spi,....Sp v , where v = — . h s denoting the 

V 

scaling factor. The group Spj contains the search nodes Npj.Npj+i, Np 7+2 a °d 
Npj+3 as rendered in fig. 8. 

30 Scaling the data volume may cause the number of search nodes Np receiving 
queries broadcast from a dispatch node N a growing quite large. The 
architecture solves this problem by using several levels X of dispatch nodes 
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N a - this is illustrated in fig. 7. which renders the arrangement of the 
dispatch nodes N a as nodes in a portion of a binary data distribution tree. A 
binary data distribution tree easily allows for a linear scalability. Similar 
binary data distribution trees of the kind that already has been disclosed in 
the applicant's above-mentioned international application PCT/NO99/00344, 
which discloses the configuration of an actual implementation of the pattern 
matching chip PMC. The number of dispatch nodes N a in a regular binary 
tree is, of course 2 k ' ] on each level X, X = L2,3... A dispatch root node is on 
the first level, and up to and including a given level X there is a total of 2*-l 
dispatch nodes in the tree. In case the dispatch nodes N a are used also as 
acquisition nodes, i.e. for gathering the answers returned from the search 
nodes, the results of a search is merged in the dispatch nodes, the root 
dispatch node outputting the final answer to the query. However, there is 
nothing against that the search engine according to the invention is set up 
with a separate data gathering tree connected to the search nodes and 
comprising acquisition nodes N 6 gathering and outputting the final result of a 
query on the acquisition root node of the data gathering tree. i.e. the data 
acquisition node tree. The acquisition node tree could then be a mirror image 
of the dispatch node tree. 

The schematic layout of a scalable search engine architecture according to 
the invention is shown in fig. 8, with the principle for two-dimensional 
scaling illustrated. It will be seen that the dispatch nodes N a constitute a 
front end of the search engine and route the queries to the search node N p and 
receives the search results back from the search nodes, wherein the actual 
search of the indexed data is performed. In case dedicated acquisition nodes 
N 5 are used, as shown in fig. 9, which otherwise is similar to fig. 8, the 
search results would, of course, be returned thereto. With the dispatch nodes 
N a arranged in a tree configuration as shown in fig. 9, the acquisition node 
network as a back end of the search engine would form a mirror image of the 
dispatch node network. The indexing (spidering) nodes N y also constitute a 
back end of the search engine and collect data from e.g. the Internet and 
index the data to generate a searchable catalogue. By adding search nodes N p 
or rather search nodes groups S horizontally the search engine scales linearly 
in the data volume, each additional search node or search node group 
containing different data. Typical capacity parameters for a search engine can 
as a non-limiting instance be given as follows. One search node N B can 



• # 
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typically handle 8 000 000 page views per day in a 5 000 000 documents 
catalogue. For a scalable search engine each search node Np typically could 
hold 5 000 000 unique indexed documents, implying that 40 search nodes in 
one row are enough to maintain a 200 000 000 documents catalogue. Scaling 
5 the performance, i.e. increasing the traffic capacity demands more rows of 

search nodes N p with the same data to be added, such that the search nodes in 
a single column or group S contain identical data. A group or column S of 10 
search nodes N p hence will be able to handle 80 000 000 page views per day, 
with 40 columns handling a total of 3 200 000 000 page views per day. 

10 An additional important benefit of a search engine according to the invention 
with the architecture scalable as herein disclosed, is that the query response 
time is essentially independent of the catalogue size, as each query is 
executed in parallel on all search nodes Np and that the architecture is 
inherently fault tolerant, such that faults in the individual nodes will not 

15 result in a system breakdown, only temporary reduce the performance until 
the fault is corrected. 

Moreover, the in principle unlimited linear scalability of the data volume and 
the traffic volume which can be provided in a search engine according to the 
invention, contrasts sharply with prior art search engines, wherein the search 

20 cost typically increases exponentially with the data or traffic volume 

increase, and wherein the maximum capacity of the prior art search engines 
typically will be reached at low to moderate volumes. With the search engine 
according to the invention the cost will scale linearly with the increase in 
capacity at most, depending actually on whether the capacity increase is 

25 provided by adding SW search nodes only or also SW/HW search nodes. 

Finally the search engine according to the invention offers the advantage that 
each node in practice can be implemented with standard low cost 
commercially available PCs, but alternatively also with more expensive 
UNIX-based servers such as for instance the Sun or Alpha computers as 

30 currently available. 
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PATENT CLAIMS 

1 . A search engine with two-dimensional linearly scalable parallel 
architecture for searching a collection of text documents D. wherein the 
documents can be divided into a number of partitions d h d 2 ....d n , wherein the 
collection of documents D is preprocessed in a text filtration system such that 
a preprocessed document collection D p is obtained and corresponding 
preprocessed partitions d ph d p2 >... d pn , wherein an index / can be generated 
from the document collection D such that for each previous preprocessed 
partition d ph d p2t ... d pn a corresponding index //, /?,... /„ is obtained, wherein 
searching a partition d of the document collection D takes place with a 
partition-dependent data set d p k comprising both the preprocessed partition 
d pk and the corresponding index with l<k<n, and wherein 
the search engine comprises data processing units which form sets of nodes 
(N) connected in a network, characterized in that 

the first set of nodes comprises a dispatch nodes (N aI , .... N aa ), a second set 
of nodes comprises b search nodes (N P i,...,N pb ), a third set of nodes 
comprises g indexing nodes (N yl ,...,N yg ), and an optional fourth set of nodes 
comprises e acquisition nodes (Nai,...N 5e ). 

that the dispatch nodes (N a ) are connected in a multilevel configuration in 
the network, 

that the search nodes (N p ) are grouped in columns (S) which are connected in 
parallel in the network between the dispatch nodes (N a ) and an indexing node 
(N y ), that the dispatch nodes (N a ) are adapted to process search queries and 
search answers, the search queries being dispatched further to all search 
nodes (N p ), and in case the acquisition nodes (N 6 ) are not present, the search 
answers being returned to the dispatch nodes (N a ) and therein being 
combined to a final search result, 

that the search nodes (N p ) each are adapted to contain search software, 

that at least some of the search nodes (Np) additionally comprise at least one 

search processor module (M), 

that the indexing nodes (N y ) are adapted for generally generating indexes i 
for the search software and optionally for generating partition-dependent data 
sets d pk to search nodes (N p ) which comprise a search processor module, 
that in case acquisition nodes (N 6 ) are present, these are connected in a 
multilevel configuration in the network similar to that of the dispatch nodes 



WO 00/68834 



13 



PCT/NO00/00155 



(N a ), and adapted for gathering answers to search queries and outputting a 
final result thereof, thus relieving the dispatch nodes of this task, and 
that the two-dimensional linear scaling respectively takes place by scaling of 
the data volume through an increase in the number of partitions d and scaling 
5 of performance through replication of one or more partitions d. 

2. A search engine according to claim L 

characterized in that the multilevel configuration of the dispatch nodes (N a ) 
and the optional acquisition nodes (N 6 ) in the network are provided by 
hierarchical tree structures. 

10 3. A search engine according to claim 2. 

characterized in that multilevel configuration of the optional acquisition 
nodes (N 5 ) is a mirror image of the multilevel configuration of the dispatch 
nodes (N a ). 

4. A search engine according to claim 2, 

15 characterized in that the hierarchical tree structures are binary tree structures. 

5. A search engine according to claim L 

characterized in that each of the search nodes (Np) comprises a search 
software module (SW). 

6. A search engine according to claim 5, 

20 characterized in that at least some of the search nodes (N p ) comprises at least 
one dedicated search processor module (M), each dedicated search processor 
module (M) being realized with one or more dedicated search processor chips 
(PMC) which each is adapted for parallel handling of a number of q search 
queries. 

25 7. A search engine according to claim 6, 

characterized in that the dedicated search processor chips (PMC) are 
provided in the search processor modules (M) in v processor groups (G), each 
with z search processor chips (PMC) and being connected with and adapted 
to receive data from a memory (RAM) assigned to the processor group (G). 

30 8. A search engine according to claim 1, characterized in that the 

increase in the number of partitions d in the scaling of the data volume is 
implemented by an increase in the number of search node groups or columns 
(S). 
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9. A search engine according to claim 8. 

characterized in that an increase in the number of partitions d in the scaling 
of the data volume is accompanied by a corresponding increase in the number 
of dispatch nodes (N a ) and. in case, also in the number of acquisition nodes 
5 (N 5 ), and optionally also by an increase in the number of index nodes (N y ). 

10. A search engine according to claim 1. 

characterized in that the replication of one or more partitions d in the scaling 
of the performance is implemented by an increase in the number of search 
nodes (Np) in each group or column (S). 

10 11. A search engine according to claim 1 ? 

characterized in that the separate node sets (N a , N p , N y , N 6 ) each is 
implemented over one or more workstations connected in a data 
communications network. 
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